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The eff,€cts of vocabulary instruction cn word 
knowledge and reading comprehension were assessed in three . 
experiments. . In experiment one, employing 12 "average" fourth grade 
readers, and' experiment two, employing six learning disabled 
intermediate level students, word, synonyms were taught to pairs of 
students on three consecutive days. Experiment three, employing ten 
fourth grade disaiied readers, examined the effect of two to four 
sessions of vocabulary instruction on comprehension of connected 
discourse. The three treatments used varied in the amount of direct 
instruction, ranging from permitting students to infer meanings from 
context to drilling students on synonyms. Results indicated that the 
treatments, were differentially effective in teaching word synonyms, 
that with increases in direct instruction more meanings were 
acquired, and that the treatments produced differential transfer to 
sentence comprehension^* Average students learned some word synonyms 
under all conditions except a noninstructional control condition; 
learning disabled students required more direct instruction to 
produce any learning. Although vocabulary training was found to 
transfer to comprehension of single sentences, on two of three 
measures of passage comprehension no effects were observed that could 
be attributed to vocabulary instruction. (GW) 
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Vocabulary and Reading Comprehension: Instructional Effects 

Conventional analyses of reading usually include at least three com- 
ponents (Gol i nkof f , 1 975) : decodi ng (word readi ng) , lexical access (re- • 
trieving word meanings), and text organization (constructing meanings from 
units larger than words). The present research focuses on the second two 
components/ individual word meaning, and meaning construction from connected 
discourse. 

As has been noted by Otto, McMenemy, and Brown, "Conseinsus regarding 
the need for systematic and extensive vocabulary development is unanimous 
among authorities in reading instruction; and the need is recognized too 
by teachers who work with disabled readers" (1973, p. 185). On the empiri- 
cal side, there is correlational evidence that implicates vocabulary in 
the reading comprehension process. Readability research and factor analysis 
studies of reading comprehension are cases in point. In their examination 
of readability, Chall (1958) and more '"ecently Klare (197^) reported that 
all quantitative investigations as well as surveys of readers' and. experts' 
opinions showed vocabulary to be related in some degree to the difficulty 
of reading materials. The single best predictor of readability was '^ome 
measure of vocabulary load, typically measured by comparing words in a 
selection of text to word frequency, lists (e.g., the Lorge-Thornd i ke 
30,000 most common words or the Dale list of 3,000 familiar wor<;ls), or by 
computing word length which is, itself, highly relied to word frequency. 
Thus one hk ,ure of passage difficulty ' Ik p )portion of infrequent 
and, (presumably, unfami 1 iar words. \ 
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Factor analysis studies conducted to identify the component sub-skills 
of reading comprehension also lend support to the importance of vocabulary. 
Davis (19^^, 1968) constructed a reading test specifically designed to 
measure nine presumably distinct comprehension skills. His results revealed 
five significant factors, one of which was knowledge of word meanings. In 
fact, Davis and subsequently others (Spearritt, 1972; Thorndike, Note 1) 
reported that word knowledge and reasoning in reading accounted for virtually 
all of the variance ?n comprehension scores. 

The most direct evidence of an experimental nature derives from a 
study by Marks, Doctorow, and WIttrock (197^). They randomly assigned two * 
versions of reading passages to sixth grade students. Fifteen percent of 
the vocabulary in each passage was manipulated to produce two versions, one 
with hfgh frequency words and one with low frequency words. Compared with 
students reading the low frequency versions, those who read the high 
frequency versions scored s i gn i f i can tl y: better (around 25%) on comprehen- 
sion questions. Marks et al. attributed this result to differences in 
the students' knowledge of the high and low frequency vocabularies. They 
also replicated this finding in a subsequent experiment using the same 
materials (Wi ttroc^i'^^Marks F, Doctorow, 1975). 

On the practical side, publishers of standardized reading achievement 
tests have highlighted the importance of vocabulary. On such popular 
tests as the Stanford Ach ievement Test (1970) and the Metropol i tan 
Ac hieve r of r^-^-.t (1 970) ,. read i ng comprehensic I martially evaluated 
by ' .cabulary knowledge ana / questions contained 

within other subtests (e.g., paragraph meaning) loreover, authors of 
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textbooks on the teaching of reading invariably delineate instructional 
procedures for improving vocabulary, and urge teachers to undertake this 
type of instruction as a means of enhancing students' reading comprehension 
(Harris, & Sipay, 1975; and Tinker & McCul lough, 1962). For example, Spache 
and Spache stated that "Understanding the vocabulary is second only to the 
factor of reasoning in the process of comprehension, and some writers would 
say that it is even more important than reasoning ... It is sufficient 
to say that comprehension is significantly promoted by attention to vocabu- 
lary growth" (1973, p. 78). 

The importance of vocabulary is also stressed by the authors and 
publishers of basal readers. Lessons in the teacher manuals regularly pin- 
point^words for vocabulary instruction. Student workbooks provide a variety 
of additional exercises related to teaching word meanings. 

The literature on vocabulary instruction and its relation to /eading 
comjprehension is largely descriptive; relatively few training experiments 
have been conducted which investigate either the absolute or the relative 
effects of various vocabulary teaching procedures. Furthermore, no investi- 
gations have been publ i shed which demonstrate that teaching vocabulary to 
children affects their reading comprehension.. Indeed, the few experiments 
on this issue have found no effects on comprehension, whether i t^ is measured 
genera] ly wi th standardized tests (Jackson 6 Dizney, 1973; Lieberman, 1967) 
or measured specifically by questions based on passages which included the 
instructed vocabulary (Pany & Jenkins, 1978).. 

Against this background, the present set of experiments was designed. 
One intent of the researches was to evaluate teaching procedures frequently 
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employed to improve vocabulary. Another intent wasf>?to determine the gener- 
ality of these procedures across types of learners (average vs. remedial). 
A third intent was to examine the effects of various vocabulary instruc- 
tion procedures on comprehension of sentiences and stories that contained 
the taught vocabulary. 

For the first experiment three conditions were devised which varied in 
the amount of direct instruction provided on word meanings. One condition, 
Meanings from Context, provided the least direct instruction, though it 
did provide students with the opportunity to Infer word meanings from a 
specially prepared, "rich" context- The contexts contained synonyms for 
the unfamiliar words which were embedded in sentences describing a familiar 
situation. In a second condition. Meanings Given, instruction was more 
direct; the teacher stated a synonym for each unfamiliar wc: u and provided 
a sample sentence which used the target word and synonym to describe a 
familiar situation. The third experimental condition. Meanings Practiced, 
involved the heaviest emphasis on direct instruction of word meanings. 
Meanings of unfamiliar words were stated and students were drilled on the 
words until they mastered them. Example sentences were provided in this 
condition, as well. It was anticipated that the three instructional con- 
ditions would differ not only in their effectiveness in teaching word 
meanings but also^in the extent to which they would influence eherr 
sion of sentences which contained the taught words. Practlcifj word 
meanings as compared to being told word meanings was expected to have 
greatei effects on all measures, and both of these procedures were expec- 
ted to be superior to learning word meanings through context clues alone. 
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EXPERIMENT 1 
Method 

Subjects 

Twelve fourth grade students, eight females and four males, ages 9-10, 
participated in the study. The students came from two classrooms and were 
selected according to their performance on a vocabulary screen! ng test , 
which- is described below. To participate in the experiment students had 
to read orally at least 75% of the experimental pretest items without error, 
but correctly identify no more than 10% of the word meaning<^n the initial 
screening test. The students had taken the Metropolitan Achievement Tesl 
six months prior to the study. Their Word Knowledge subtest scores ranged 
from 2.9 to 6.7 with a mean of 5-0, and their Reading subtest scores ranged 
from 2.9 to 8.0 with a mean of k.k. The Economy Keys to Reading program 
(1972) and the Macmillan Reading Program ( 1 970) were used in the two class- 
rooms ?n a traditional group instruction format. At the time of this study, 
all students were reading in the fourth grade level texts or above. Two 
students had been placed in a high reading ;group, , s I x i n a middle reading 
group, and four in a low reading group by their classroom teachers. None 
of these students were receiving remedial instruction outside their regular 
'classroom program. • 

Selection of Vocabulary 

Sixty-five words that the experimenters judged would be unfamiliar 
to many fourth grade students were drawn from several fourth grade reading 
texts--Macmi 1 Ian (1970), Economy (l 972) , Amer i can Book Company (1972), 
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Houghton Mifflin (1976), Lippincott (1975), Holt, Rinehart and Winston 
(1973), and Scott Foresman (1973). With one exception, the words selected 
were phonetically regular, and were definable by a more familiar one- or 
two-word synonym. Except for one word which was familiar to 76% of sixth 
grade students, all synonyms were judged to be familiar to 69% to 33% of 
fourth grade students according to Dale and O^ftourke (1976). 

A 65 item multiple choice vocabulary test was constructed in which 
each target word pr^eded four randomly ordered choices: a correct 
synonym, a randomly selected synonym of another item, and two distractors 
V/hich had approximately the same frequency of occurrence in elementary 
school readers as did the correct synonym .(Carrol 1 , Davis & Richman, 1971). 
All choices for each item were the same part of speech. This multiple 
choice test was administered as a pilot to 10 third and 30 fourth grade 
students in their classrooms. Every item was read aloud twice by an 
experimenter as students read along and circled one of the choices. 

From these data, the kO most frequently missed items were used to 
construct a multiple choice screening test for the experiment. The items 
were revised so that the synonym ^stractors were drawn from this reduced 

!f,V .... 

^* pool of kO words. This screening test was orally administered to all 
students in the two fourth grade classes. Experimental students were 
then identified according to the previously stated criteria of ]0% or 
less correct. A final set of 2k target words was selected. No more than 
two students who were chosen for the study had selected correct synonyms 
for any single target word. The target words consisted of seven nouns, 
eight verbs, and nine adjectives. 

o 



" ^ Vocabulary and Read?ng Comprehension 

8 

Design 

An Incomplete randomized block design was used in which students 
served as their own controls and participated in all experimental ccyndi- 
tions. For six randomly paired students from one classroom, the 2k target 
words were randomly divided into three groups of eight words each. Word 
groups were assigned to pairs of subjects and to instructional days , 
according to a randomized Latin Sc^uare arrangement. Within each 8-word 
group, two words were randomly assigned to each of the fou*!* e;sper Imenta 1 
r iditions. All students received each trea||ient, with treatment order 
randomized across pairs of students. For the six students from the second 
classroom, all randomization was repeated to yield a different set o^ 
materials and order of presentation. 

Treatment Conditions ^: 

In each of the four conditions, two typed words and/or sentences were 
presented on .076 x .127m index cards. Students read each card si'iently, 
then orally, then a^ain silently; Certain treatments required additional 
experimental procedures; these always occurred Ydl lowing students* oral 
responses. The treatments are described below. 

Meanings from Context . No direct instruction was provided on word 
meanings. Instead, students read two sentences, the first containing a 
target word. The second sentence was related-feb the first and contained 
a synonym of the target word. For* example. 

Student reads: "Dan is a real buffoon. He is the 

funniest clown in the circus." 



^^^^ " 
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Meanings, given . Students read a sentence containing a target word. 
Next, the experimenter stated botljj the meaning of the target word and a 
sample sentence using the word as the child might^jhear it sn his daily 
experience. For example, 

Studrnt reads: "Dan is a real buffoon." 

Experimenter says: "Buffoon means clown. Teachers do not 

like the^ir students to behave like 
buffoons or clowns in school." 
"Read the sentence again to yourself." 

Meanings Practiced . Students read a single target word. The experi 
manter stated a synonym and a sample sentence using the target word. Stu 
dents then repeated the target word and the synonym. For example, 

Student reads: "buffoon" 

Experimenter says: "Buffoon means clown. Your teacher may 

become angry if you behave like a buffoon 
in class." 

"What does buffoon mean?" 
Student 1 says: "Buffoon means clown." 

Experimenter says: "What does buffoon mean?" 
Student 2 says: "Buffoon means clown." 

In this condition, two additional words were presented with the target 
words so as to increase the task d I f f i cul ty. and to i'nsure that students 
attended to eacfi word. Students were not told that they would be tested 
on only the two targ-^t words. When all four words had been presented, 

ID 
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they were reviewed and the index cards were shuffled^ Then, the experlr 
menter presented all four cards, one at a time. One student read the 
word and attempted to state its meaning. The exper Imi^nter supplied 
corrective feedback when necessary. This procedure continued until* the 
student had given correct*, mean I ngs for a.ll. four target words on three 
consecutive trials. The experimenter then repeated this procedure for 
the second student. 

-j^o Meanings Control . Students read the target words, each printed 
singly on an index card. For example. 

Student reads: "buffoon" . * 

Dependent Measures 

In all, fo^ur separate measureiment instruments were constructed two 
assessed vocabulary knowledge anxl two assessed sentence comprehension. , 
The first measure was an Isolated Word Vocabulary Test which Consisted of 
the target words typed in a single column. The student read each word 
orally and gave its synonym; Responses were recorded by an experimenter. 
The second vocabulary knowledge measure, the Multiple Choice Vocabulary 
Test, contained the 2k target words each followed by four randomly arranged 
choices. The choices were constructed in the same manner as the screening 
test. Students silently read each item and circled their answer. 

In addition, two types of sentence comprehension were tested. For 
the Sentence Peraphrase Test, a novel sentence was constructed for each 
target word; e.g., "I think his talk was rational." Students orally read 
each sentence and attempted to restate the sentence without using the- 
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target word. AVI responses were recorded. The second sentence comprehen- 
sion measure was a 96 item Sentence Anomaly Test. Four sentences vyere 
generated for each target woi"d so that at least one sentence per target 
word made sense, i.e., the target Word was syntactically and semantical ly 
appropriate. At least one sentence did hot make sense, i.e., the target 
. word was syntacti cal ly i ncorrect . for example, ''Mother put the toys in 
the garret'* was a sensTtT^ sentence-, and "The garret grew up" was an 
^XanomalQus sentence. .The 96 sentences were printed in random order. Stu- 
^nts silently read each sentence and marked a'plus. (+) if it made sense, 
a minus (-) if it did not, and a zero (0)jf they did not know. Guessing 
was discouraged. Students received training on the task with non-target 
items prior to testing. 

. Scoring and reliability . Each item on both measures -of word knowledge 
was worth one poi,nt. For the Sentence Paraphrase Test one point was given 
for each answer considered to be correct by at least two bf three scorers • 
For the Sentence Ariomaly Test, a student had to correctly (designate at 
least three of the fbur sentences per ^target word to be awarded one point. 
All tests were scored, independently by three people. Agreement by two 
of the three was necessary on each item. The three scorers agreed on more 
than 99^ of all test items. 

Procedure • 

Fjor two consecutive days priot to instruction, stL^dents came to t 
e^xpfer imenta 1 room and completed four pretests. On the first day . 

f-esting, each student completed the Isolated Word Vocabul^ry^ Test and 
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the Sentence Paraphrase Test^^with an^experlmenter. Students were also 
able to complete approximately half of the Sentence Anomaly Test inde- 
pendently. The remaining half of this test was administered on the second 
testing day, and was followed by the^-Mul t i pie Choice Vocabulary Test. 
Students were instructed to ask* the experimenter to supply any words they 
could not read. As the students worked on the Sentence Anomaly and ^ 
Multiple Choice Tests, experimenters randomly selected test items for 
Individual students to read orally.. Each student's "^correct oral reading 
responses, combined across ^all oral tests and orally sampled items of 

the wri.tten t3Sts, were recorded and computed as a measure of oral reading 
1 . IS 

accuracy. 

The treatment cond! t ions, were introduced one day after completion 
of the pretest ing. Students came to the experimental room in pairs for 
three Qonsecutive days. Each experimenter tauqht a different 8-word set 
to a different pair of students each day.^ Two words were presented in 
each of the four treatment conditions every day. Treatments were randomly 
ordered for each student pair. Time required for the treatment was 
approximately ^5 seconds for Context, 65 seconds for Given, 6 1/2 minutes 
for Practiced, and 15 seconds for the Control. After three days all Ik 
words had been presented to each student pal r. 

Posttests were administered daily followihg theT four treatments. The 
posttest items were identical to those used in the pretest, however the 
daily posttests included only those' items associated with that day's 
eight target wordSi Students were tested individually: one member of 
each pair began the Sentence Anomaly^Test while the experimenter recorded 
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the other student's oral responses to the Isolated Word and the Sentence 

Paraphrase tests. The experimenter then tested the first student while 

! 

the second student wrote answers to the Sentence Anomaly Test. Studer^ts 
completed the Multiple Choice Vocabulary Test last. Throughout each 
session, students were awarded points for their participation. They then 
exchanged their points for small tangible reinforcers at the end of each 
dai ly session. 

To assess retention effects, all students were retested on the two 
vocabufary tests two weeks after the experiment. 

Results 

A total score was calculated for each student (N = 12) under each 
treatment for each of the four immediate tests and the^ two delayed tests. 
Scores for ind i vi due 1 . s tridents ranged from 0 to 6 out of a possible total 
of 6, i.e., the n'jmber of words in each treatmenO:. Both pre and posttest 
scores were analyzed in a repeated measures randomized block factorial 
design. Separate two-way analyses of variance, 2(Tests — pre and post) 
X A(Treatments) , were performed on each dependent variable. Posttest 
means and standard deviations are displayed in Table 1. 

Insert Tg^le 1 about here 

Significant overall Test , £( 1 , 77) > 128.3^, £< .01, Treatment, 
£(3,77) > 15.30, £< .01, and Test by 'Treatment interaction, F_(3,77) > 
21.^5, £ < .01, were observed on the four dependent measures given 
Immediately after training. On the two delayed measures, analyses 
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revealed significant Test, £(1,77) > 33.21, £< .01, and Test by Treatment 
Interactions, F(3,77) > 2.76, £< .05. 

Tests foKsimple effects revealed the following. There were no sig- 
nificant pretest differences among Treatment groups, £(3,77) < 1.38, 
£> .05, for any dependent vari able. In contrast, there were significant 
posttest differences amohg the treatments on each dependent variable, 
F(3,77) > 4.83, £ < .01: Significant pre to posttest differences were not 
observed for the Control condition on any dependent measure, £(1,77) < 1.12, 
£> .05. In general, pre to posttest changes were significant, £(l ,77) '> 
7.86, £ < .01, for all experimental groups on each dependent variable. 
Exceptions were the two delayed vocabuj^ry tests where pre ^to posttest 
differences for the context treatment diminished in size. Specifically, 
there was no difference on the Isolated Word-Delayed Test, £(l,77) = 3.52, 
£> .05, and a difference on the Multiple Choice-Delayed Test, £(3,77) = 
4.76, £ < .05. 

Tukey HSD contrasts were performed to^ determine which. postAest means 
differed significantly. In general. Practice means were significantly 
higher, £ < .01, than Gfven means. Exceptions occurred on the Multiple 
Choice-Delayed Test where practice exceeded Given at only the .05 level 
of confidence, and on the'^ I sol ated Word-Delayed measure, where the 
aiffferences did not reach significance. .With one exception. Practice and 
Given means exceeded the means of both the Context and Control conditions, 
£< .01. Oh the Multiple Choice-Delayed Test, Given means did not differ 
-significantly from either Cpntext or Control, £ > .05. Means of the Context 
■treatment exceeded means of the Control on the Parajihrase measure, £ < .05, 
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and on the Isolated Word measure, £< .05. Context and Control means did 
not differ significantly on any other measure. 

Pi scuss i on 

With the few exceptions enumerated above, the treatments were similarly 
ordered in f^ffectiyeness across all dependent measures, with Practice means 
exceeding Given, which exceeded Context, which In turn exceeded the Control 
condition. This ordering further reflects the amount of direct instruction 
provided by a teacher; practice of wor'd meanings involved the greatest amount 
of direct instruction, giving meanings required less direct instruction, 
and reading meanings in context required even less teacher-directed instruc- 
tion. Although the pattern of instructional effects was similar for immediate 
and delayed tests, the instructional conditions that produced weaker effects 
on immediate measures were sometimes not differentially effective on the 
delayed measures. This was especially true when word meanings were acquired 
through context, in which case performance on delayed tests failed to indi- 
cate retention of any meanings. However, the Pract i ce cond i t i oh which pro- 
duced the greatest effects on immediate measures also yielded the best 
retention. 

The results indicated that the procedures which were differentially 
effective in teaching the meanings of single words were a 1 so d i f f erent i al 1 y 
effective in producing transfer to sentence comprehension. Students best 
comprehended the meaning of sentences that contained words taught in the 
Practice condi tion. They comprehended sentences least well when the sen- 
tences conta i ned wgrds that had been taught in the Context condition. 
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EXPERIMENT 2 

A second experiment was undertaken to assess the general i zabi 1 s ty 
of the results obtained in Experiment 1 to below-average readers. Spe- 
cifically, the study sought to examine the relative effectiveness of the 
three instructional procedures when they were applied to children who., 
were remed i a 1 readers . 

Method 

Subjects 

The subjects (N = 6) were four fourth and fifth grade females and 
two sixth grade males, ages 10-13, all of whom were classified as learning 
disabled and were receiving reading instruction -from a special education 
resource teacher. The school district classifies as learning disabled 
those students who test in the normal range on the WISC-R bnd whose per- 
formance is -below average in some academic area; Their primary measure 
to discriminate learning disabled from . non-hand i capped students is per- 
formance on the Developmental Test of Visual-Motor Integration (Beery & 
Buktenica, 1967). In accordance with school di stri ct policy, students' 
scores on^ttiese measures were not available to the experimenters. However, 
scores on the Stanford Achievement Test (1970) indicated students' reading 
comprehension as measured by the Paragraph Meaning subtest ranged from 
1.1 to 2.6 years below grade level. 

The Economy Keys to Reading program (1972) was used in both classroom 
and resource room instruction. At the time of the experiment, students 
were receivi^ng instruction in one of the third through sixth grade texts. 
Instructional placement in a particular text was buSed on students' 
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performance on a cr i tarion-referenced , curriculum-based assessment. Daily 
instruction for these students Included approximately 20 minutes of one-to- 
one oral reading to the resource room teacher followed by oral or written 
responses to factual comprehension questions drawn from the reading passages 
Students had participated in this type of instruction for approximately 
seven months prior to the experiment. The experiment was conducted in 
the resource room. All procedures and qiaterials were identical to those 
used in Experiment 1. / 

Resul ts 

Statistical analyses were identical to \those of the- first experiment. 
A total v/as calculated for each student (N = 6) under each treatment con- 
dition for the six dependent measures. Scores for individual students 
ranged from 0 to 6, out of a possible total olF six. Pre and posttest 
scores v/ere ancV/zed in a repeated measures randomized block factorial 
df.sFcn. Separate 2(Tests) x ^(Treatments) analyses of variance v;ere per- 
fv.rmed .fcv:!i dependent variable. Posttest means and standard deviations 
are dis: '.yeJ in Table 2. 

Insert Table 2 about here 

Significant overall Test, _F(l,35) > 8.^, £< .01, Treatment, £(3,35) > 
5.1, £< .01, and Test by Treatment interaction, £(3,35) > 7.6, £ < .01 , 
effects were observed on all dependent measures except the two delayed 
tests of vocabulary knowledge. There were no significant Test, Tre<itment, 
or Test x Treatrtient interaction effects on the Isolated Word-Delayed Test. 
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On the Multiple Choice-Delayed Test, only the Text x Treatment interaction 
F(3,35) = 3.92, p < .05, was significant. > 

Tests for simple effects revealed no significant pretest differences 
among Treatment groups, F^(3,35) < .81, £ > .10, on any dependent variable. 
In contrast, posttest differences among Treatment groups, F(3,^5) > 12.0, 
£< .01, were highly significant on the four dependent measures given 
immediately after training. Posttest differences were signi fi cant .on the 
Multiple Choice-Delayed Test, F^(3,35) = 5-38, £< .05, but not significant 
on the Isolated Word-Delayed Test, F(3,35) = 1.88, £ > .05. 

Pre to posttest changes were next examined for each treatment group 
each dependent variable. No significant gains were observed for either 
the Control or Context condition on any dependent measure, IF(l,35) < 2.8^, 
£> .05. The Given condition produced significant gains on the Isolated 
Word, F(l,35) =^.86, £< .05, and the Multiple Choice Test, F(l,35) = 8. i 
£ < .01. No significant pre to posttest gains were observed for the Given 
condition on any other measure. With the exception of the Isolated Word- 
Delayed Test, on which no sip[Rificant pre to posttest differences were 
observed, the Practiced condition produced significant pre to posttest 
gains on all dependent measures, £(l,35) > 9.72, £< .01. 

Tukey's HSD Multiple Comparison Test of pairs of treatments indicated 
that, in general, only the Practiced means differed significantly from 
all other treatment means'at the .01 confidenr-^; level across dependent 
measures. Exceptions occurred on the Multiple Choice-Delayfed Test where 
Practice exceeded the Control means at only the .05 conf idence ,^level , and 
on the Isolated Word-Delayed measure, where no significant differences 
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were observed. Given means differed significantly from Context, £ < .05, 
on only the two immediate vocabulary measures and the Sentence Paraphrase 
Test. Significant differences between Given and Control means were 
observed only on the Isolated Word Test, £ < .01, and on the Multiple 
Choice Test, £ < .05. All other mean differences on all dependent measures 
were nonsignificant. 

Pi scuss ion 

For the learning disabled readers, the^Practicfe condition appeared 
to be the most effective Instructional procedure for teaching synonyms. 
The Given condition also produced significant effect '^^».abu1ary 
measures, but its effects were far weaker than thoz. produced by Practice. 
In fact, performance under Practice exceeded performance under Given by a 
factor of 3.6 on the Isolated Word measure and by a factor of 2 oo: the 
MultipMe Choice measure. Context produced no appreciable vocabulary- 
learn ing-wi-t4i-^th i s- sampjejpf learners . The strongest procedures In Experi- 
ment 1 were also the strongest in Experiment 2. The weakest procedure in 
Experiment 1 (Context) was not at all effective with the learning disabled 
students'. Compared with the non-handicapped sample of Experiment 1, the 
learnjng disabled sample appeared to acquire fewer synonyms under each 
treatment condition." This was true whether or not students W6re merely 
presented with the synonyms (as In the Context and Given conditions) or 
brought to cril^erlon (as in the Practice condition). 

The findings with regard to sentence comprehension were similar to 

thos^ on the vofcdbulary measures. The Practice condition produced the 

\ 
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best sentence comprehension and the Context condition the least. Taking 
both experiments together, there appeared to be rather direct transfer 
^from synonym acquisition to sentence comprehension. Where vocabulary 
training produced weak effe'cts on vocabulary acq'jSsition (the Context 
condition in Experiment 1 and the Given condition in Experiment 2) there 
was little or :'no. transfer to sentence comprehension (to Sentence Paraphrase 
and Sentence Anonjaly, respectively). Retention for the learning disabled 
sample was depressed compared with the normal sample. Indeed, with the 
more stringent measure. Isolated Word-Delayed Test, there was virtually 
a complete loss of training effects. When all measures are considered 
across both experiments what appears to be an interaction of treatments 
with learner type may be essent i al ly maih effects for learner type and 
directn\ss of instruction. That is, all students benefited increasingly 
from increased amount of direct instruction, and normal students required 
less direct instruction than learning disabled students. 

EXPERIMENT 3 

The data gathered in the preceding experiments suggests that increased 
amounts of direct instruction faciiitates acquisition of vocabulary, and 
that acquisitiorl of isolated vocabulary transfers positively to sentence 
comprehens ion. The next experiment was designed to learn whether vocab'J- . 



lary training improves "passage ^ comprehens ion . As noted in the introduction, 

x.^ — • ■ . 

pr^^ious attempts to obtain transfer to connected discourse have failed. 

One ekp^lanation for these failures may be that rather weak vocabulary 

training procedures were employed* Thus, it seemed important to select 
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a strong vocabulary training procedure for the present experiment. Since 
the Meanings from Context and Meanings Given conditions were distinctly 
Inferior to the Meanings Practiced condition in the previous two experi- 
ments, a decision was made to employ this last treatment so as to maximize 
the chances of obtaining transfer effects to prose. A standard transfer 
research paradigm was chosen in which performance oh the transfer task, 
reading comprehension, was compared with and without prior appropriate 
training. 

Method 

Subjects 

Students participating in this study were ten fourth graders, six 
male and four female, ages 10-12, who were attending a summer school pro- 
gram for children of economically deprived families. Score;* on the 
Metropol i tan Achievement Test Indicated students' vocabulary scores were 
below grade level; the Word Meani ng subtest ranged from 1.7 to 5.8 with 
a median of 3.2. Their comprehension was also deficient; Reading subtest 
scores ranged from 1.6 to ^.5 with a median of 2.7- No student coulcj^ 
correctly define any of the 2^ experimental words. 

Pes! gn 

A within subjects design was employed in which studentsyseV^^^d-as— 
their own controls. The 2^ target words ^rom the previous experiments 
were divided into two l2-word sets. Two groups of students were randomly 
formed 50 that for one group of students, one set of words served as 
rnftructional words and the other set as control words.- For the other 
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group of students, the opposite sets of words were assigned to ins'truc- 
tional vs. control conditions. AM students were pre and posttested on 
both sets of words. 

« ■ * ■ ,. 

■ • ^ ' ' \ ' -y 

Treatment Conditions 



In each of the two conditions, all 24 words were printed onMn^dex 
cards. 'Students were grouped Into sets of two or three for instruction.. 
They received practice with the index cards until everyone could read 
the words with, 100% accuracy. After reaching this cri ter ion students 
began training on one of the two sets of target words. 

Synonym Instruction . This experimental procedure was similar to the 
Practice condition in Experiments 1 and 2. An experimenter showed a printed 
target word, said the word and a I-word synonym, then stated a sample 
sentence containing the word. Each student individual ly stated the word 
and its synonym; then the group repeated them in unison. For example. 

Experimenter shows 

printed ;^*Td and says: ''Debris means trash. After the picnic, 

we put the debris in the garbage cans." 

Student I says: -"Debris means trash." ' ? ' ? ' 

Stutfent 2 says: "Debris means trash." 

Student 3 says: "Debris means trash." 

All students say: "Debris means trash." 

After three words had been presented, the cards were shuffled and 
the group practiced that subset until each student could provide correct 

synonyms for all three words on one trial. Then another 3-word §ubset was 
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Introduced and practiced to criterion in the same manner. Next, these 
two subsets were combined and practiced until each student could give 
correct synonyms for all six words. Then the remaining six words were 
taught in the same way. Finally, all 12 experimental words were combined, 
and then students were given individual test trials. Practice was termina- 
ted when each student gave correct synonyms for one complete trial. 

No Instruction Control . Students read only the target word. No 
synonym instruction was provided. 

Dependent Measures 

Seven tests were used; two assessed vocabulary knowledge, two assessed 
sentence comprehension, and three assessed paragraph comprehension. The 
Isolated Word and Multiple Choice ^Vocabulary tests used in Experiments 1 
and 2 were again used to assess acquisition of word meanings. The Multiple 
Choice Test was restructured so that the three di. tractors for each word 
were synonyms of other randomly selected target words from the same 12-word 
set. All distractors were of the same part of speech as the correct 
synonym. SeYitence comprehension was measured by the Sentence Paraphrase 
and Sentence Anomaly tests used in Experiments 1 and 2. 

Two stories were written,' each of approximately 170 words. Each con- 
tained one of the two 12-word sets of target words. Readability leva s 
were In the fifth to s ixth grade range as determined by the Dale-Chal' 
Readabi l i ty , formula. 

Three sets, of tests were constructed to measure students *' comprehens ion 
of these paragraphs. Students individually completed all tests with an 
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experimenter who corrected reading errors when necessary, and recorded 

all responses on a separate test copy* All students received individual 

Instruction on each task prior to test administration. 
. ■ ' * i . . • ■ ^- 

Cloze Test . Five typed forms of a Cloze Test were derived for each 

Story. The first sentence of the paragraph remained intact. Beginning 

with either the first, second, third, fourth or fifth word in the second 

sentence, every subsequent fifth word was deleted and replaced by a 20- 

space blank. Students orally read one randomly assigned cloze form of 

the story and supplied as many deleted words as possible, which the examiner 

wrote down. Cloze Test responses were scored two ways, one with exact 

word replacements., the other with semantical ly acceptable replacements. 

For each scoring,. a percent was computed. 

Story Retelf Test .^xAfter they finished reading an intact copy of a 
story, the students were asked to tell everything they could remember about 
the story. An initial prompt, 'This story was about . . was used when 
necessary. The only other prompt, "Anything else?*' was used when a student 
had paused for several seconds. Retells were tape recorded, and scored 
later. A proppsi tional analysis (Kintsch, 197^) was made of each story. 
The number of correct and incorrect propositions recalled was. computed on 
each recall protocol. 

Comprehension Questions . An experimenter orally asked ten factual 
questions about each paragraph. Questions were directed at, story details 
which contained the experimental words. Examples of comprehension ques- 
t ibns include: 
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What did BMl always act like 'in^ cl.asst ' 
(a buffoon ) ' %^ ' ^ 

Wh^it was Bill pretending to do during arithmetic class? . 

( vanquish a horde of elephants) * • 

■ . " ' ' ^ , r 

Students' answers were tape recorded and scored la:ter/ Each correct^ly 

answered comprehension question was awarded one^point, yielding a po/sible 

total of ten per story. " . " 

Procedure - ^ 

Students came to the experimental room for one-half h6ur daily. On . 
the two days prior to Instruction, students Individually completed four 
pretests: the Isolated Word and Multiple Choice Vocabulary. tcists;, and. 
the Sentence Paraphrase and Sentence Anomaly Tests.' Once fn'struction 
began, three groups of students required only two instructional sessions . 
to reach criterion on their 12-word instructional seti ^ The fourth group 
of students required an additional two days of instruction. Individual- 
posttesting began for students the day after their instructional group 
reached criterion. First they completed a randomly assigned form of the 
Cloze Test containing their 12 experimental words, then* the entire - • 
Isolated Word Vocabular'y Test, and finally one-half of the- Sentence Para- 
phrase Test containing the experimental words. The next day, each student 
read the Intact experimental story, performed the Story Retell,. and 
answered the Comprehens iori Questions. In the time remaining, students 
completed as many Sentence Anomaly posttest Items as possible. 

On the following day, students completed a randomly sr'ected form 
of the Cloze Test for the Control story. They then finished the Sentence 
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Paraphrase Test for the Control words as well as any remaining Sen- 
. ten ce Anomaly items. On the final day, students read the intact Control 
story, p-^rformed the Story Retell, and answered the comprehension questions. 
Finally, they completed the entire Multiple Choice Vocabulary Test* 

Resul ts 

A total score was calculated for each student" (N = 10) under each 
treatment condition for the .seven dependent measures. Performance on experi 
mental and control passages was compared via _t tests for dependent samples. 
Posttest means and standard deviations are displayed in Table 3- 

Insert Table 3 about here 

Resul ts indicated significant differences between control and experi- 
mental means on all vocabulary and sentence comprehension measures. Mean 
scores were significantly different on the Isolated Word Test, t^(9) = 88.5, 
£< .01, the Multiple Choice Vocabulary Test,'_t(9) = 31.3, £ .01, and 
the Sentence Anomaly Test, t(9) = 18.0, £< .OK 

However, inspection of the results related to reading conrprehens I on 

• ' , . ■ - .? 

yields a distinctly different impression. Experimental and control treat- 
ments produced s i-gni f i^cant differences on Comprehiension Questions , t_(9) = 
^♦.8, £<..01, but not on Story Retell, t_(9) = 1.6, £> .25, or op Cloze, 
^^_(9) < 1, regardless of the scoring procedure. 

Pi scuss ipri ' 

Synonym practice again prove'd^ to be a highly effective procedure for 
vocabulary teaching, and one that resulted in positive transfer to sentence 
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"^comprehension. Indeed, the effects were striking, with students achieving 
nearly perfect performance on both the vocabulary and the sentence measures 
While there was some variation in the instructional tirrie required for 
students to learn the synonyms, most of the students acquired 12 new 
vocabulary words in under one hour, and were able to demonstrate their 
knowledge of these words one day later. It is notewprthy that all of 
these students were considered to be disabled readers by their schools. 

The results of vocabulary instruction on story comprehension are some- 
what perplexing. Students did not appear to benefit from their vocabulary 
knowledge advantage when they read connected discourse. Neither of the 
more general measures of reading' comprehension , Cloze and Story Retell, 
favored the experimental treatment. The fact that students answered more 
comprehension questions after receiving vocabulary training might be con- 
sidered a trivial finding since the answers to, ^questions were themselves 
target -.-lords. Thus, the observed facllitatfpn is potentially accounted 
for by word familiarization effects rather than by knowledge or meaning 
effects (Murray & Gi riooly, 196/). - 

GENERAL DISCUSSION 
Results of these experiments have demonstrated the differential ^ 
effectiveness of the three methods of vocabulary i nstf'Uctibn. In general, 
both average and disabled readers learned and retained the greatest number 
of vocabulary words by a Practice method of Instruction. Students learned 
fewer word meanings when instructors simply told them synonyms; they 
learned the least number of new word meanings when synonyms were presented 
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in context. As noted earlier, the relative efficacy of the instructional 
techniques reflected the differential amounts of direct instruction pro- 
vided. The greatest amount of direct instruction occurred in the Practice 
condition which produced the greatest amount of synonym acquisition; the 
least amount of direct instruction was provided -in-Context, the least 
effective procedure. The amount of time given to each instructional pro- 
cedure is^jof course, confounded with the direct instruction variable. 
Clearly/ Practice required the most instructional time and Context the 
least. However, the time difference between the Given and Context pro- 
cedures was negligible, and differences in vocabulary learning ,]Cannpt be 
accounted for by this factor. 

Learner type also appeared to be an important factor. Compared to 
'^normal" readers, learning disabled youngsters required more direct instruc 
tion before they evidenced significant vocabulary growth. Whereas the 
normal readers benefited somewhat from the Context trec»tments, but rather 
substantival ly from the Given treatment, the learning disabled students 
were unaffected by the Context treatment and only minimally affected by 
the Given treatment, Th^se results replicate those obtained in earlier 
research with learning disabled students (Pany & Jenkins, 1978), 

Word meaning knowledge did affect students' comprehension of sen- 
tences; comprehension of story was also affected when questions were 
directed at sentences containing the target words. However, vocabulary 
instruction apparently failed to influence more global' comprehension as 
assessed by cfoze and Retell, _ 

] 
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Explanations for this failure to affect reading comprehension fall 
into two categories. The first might be thought of as "problems with 
instructional methodology." Possibly, the increased task demands involved 
in comprehending connected discourse require greater vocabulary facility 
than that produced by the instructional procedui;es employed in the present 
study. The Practice condition was sufficient in helping students compre- 
hend sentences when the students were permitted to study the sentences 
one' at a time; both sentence measures demanded only processing of single 
unrelated sentences. This task may not demand the speed of lexical access 
that is required with more normal reading materials, e.g. stories. To 
help students comprehend longer, more natural discourse selections, however, 
a different type of vocabulary training may be ne^eded, one that guarantees 
rapid or automatic lexical access (LaBerge S Samuels, 197^; Perfetti S 
Lesgold, in press). Perhaps a vocabulary training procedure that went 
beyond an accuracy criterion, emphasizing speeded synonym retrieval , might 
be effective iri faci 1 itating comprehension, 

A second category of explanation for the observed, nonfaci 1 i tat ion of 
reading comprehension involves the contribution of vocabulary to reading 
comprehension. Pdthaps the presumed importance of vocabulary knowledge 

0 

(Becker, 1977) has been somewhat overestimated. It may be that readers 
can tolerate an unexpectedly high proportion of unfamiliar words without 
suffering comprehension losses. This explanatioh would seem particularly 
plausible if the reading passages are ones for which students already 
i possess well developed knowledge structures or schemata. When faced with 

30 ; . 
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passages based on familiar themes, perhaps readers need only to detect 
sufficient fragments of 1 nformat Ion' to recognize the theme. From this,, 
they then construct the authors' intended ineanings based on their own 
"knowledge recipes''' or schemata (Anderson, 1976). In any case, the 
presence of unfamiliar words in the current passages may not have resul- 
ted in the intended disruption of comprehension. Research which addresses 
the effects of varying densities of unfamiliar words would be useful.^ 

The findings from the three experiments may have i mpl i cati ons ' for 
instructional practice. Often, teachers devote some time to introducing 
^new vocabulary prior to assigning a reading selection. If the primary 
intent of this practice is to facilitate their students' comprehension of 
the forthcoming selection, there may be cause to reexamine this assumption . 
If the intent of this practice is to help students acouire new vocabulary, 
then it may be wise to consider a di rect instruction format for the vo- 
cabulary teaching, especially if the students are unsophi sticated ^or 
disabled readers. 
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'no student's reading accuracy fell below the predetermined criterion 

of 75% correct words. 

2 ' " 

This figure represents the time required tb( teach four vocabulary 

words, only two of which were target words. See Treatment Conditions , 

Meanings Practiced. 



Table 1 , . _ 

Posttest Heans and Standard Deviations for the Number Correct on Each Dependent Variable 
Under Each Treatment Condition for Averaget^tudervts 
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Note. Six is the highest score obtainable. 
^Standard Deviations 



Table 2 

Posttest Means and Standard Deviations for the Number Correct on Each Dependent Variable 
Under Each Treatment Condition for Learning Disabled Students 



No meanings control Meanings from context Meanings given 
Mean Mean Mean 



Meanings practiced 
Mean 
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Note . Six is the highest score obtainable. 
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Table 3 • 
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Pqsttest Means and Standard. Deviations for all Dependent Measures Under Both Treatment Conditions 



No instruction control Synonym instruction 
Highest possible score Mean Mean * 



so la ted Word Vocabulary Test 12 ,1 11.9 

(.32)' ^ (.32) 

jltiple Choice Vocabulary Test 12 .9 ■ 11.8 

(1.10) (.63) 

sntence Anomaly Test ^12 .5 11.0 

■ (.52)' d.56) 

' # f ■ ' 

;ntence Paraphrase Test 12 - .6 ' 11. A 

(.87) ■ (1.07) 

loze-exact Scoring • ,100% 33.7% ' 33.3% " 

" . (13.19) (-13. 03) 

1 oze-mean I ng Scoring . 100% 53.3% ' 55.0% 

... (20.72)' i (16.70) 

' ■ . / i / 

tory Retell 100% . _ 21.7% 28.5% 

(9.73) ■ (9.10) 

jmprehens ion (iiJest ions 100% ' 4.20% 7.40% 

^ ^ * (1.48) . (1.78) 
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